Bag of Peaks: interpretation of NMR spectrometry
نویسندگان
چکیده
MOTIVATION The analysis of high-resolution proton nuclear magnetic resonance (NMR) spectrometry can assist human experts to implicate metabolites expressed by diseased biofluids. Here, we explore an intermediate representation, between spectral trace and classifier, able to furnish a communicative interface between expert and machine. This representation permits equivalent, or better, classification accuracies than either principal component analysis (PCA) or multi-dimensional scaling (MDS). In the training phase, the peaks in each trace are detected and clustered in order to compile a common dictionary, which could be visualized and adjusted by an expert. The dictionary is used to characterize each trace with a fixed-length feature vector, termed Bag of Peaks, ready to be classified with classical supervised methods. RESULTS Our small-scale study, concerning Type I diabetes in Sardinian children, provides a preliminary indication of the effectiveness of the Bag of Peaks approach over standard PCA and MDS. Consistently, higher classification accuracies are obtained once a sufficient number of peaks (>10) are included in the dictionary. A large-scale simulation of noisy spectra further confirms this advantage. Finally, suggestions for metabolite-peak loci that may be implicated in the disease are obtained by applying standard feature selection techniques.
منابع مشابه
Mass spectrometry assisted assignment of NMR resonances in reductively 13C-methylated proteins.
Reductive 13C-methylation of proteins has been used as an isotope labeling strategy to study protein structure, function, and dynamics by nuclear magnetic resonance (NMR) spectroscopy. However, assigning the resulting 13C-dimethylamine peaks in a 1H-13C NMR spectrum has proved to be difficult, but it is important to expand the scope of the method. The assignment strategy presented here utilizes...
متن کاملMUNIN: a new approach to multi-dimensional NMR spectra interpretation.
A new method, MUNIN (Multi-dimensional NMR spectra interpretation), is introduced for the automated interpretation of three-dimensional NMR spectra. It is based on a mathematical concept referred to as three-way decomposition. An NMR spectrum is decomposed into a sum of components, with each component corresponding to one or a group of peaks. Each component is defined as the direct product of t...
متن کاملTheoretical investigation of the implicit effects water molecules and resonance interactions on structural stability and NMR tensors of hallucinogenic harmine by density functional calculations
Abstractl Density functional theory (DFT) was used to investigate the effects of intra-moecular interactions and implicit water molecules on the relative stability and the NMR shielding tensors of hallucinogenic harmine in the monomeric and dimeric states. Results represented that the relative stability and the NMR shielding tensors are dependent on the resonance interactions and chemical envir...
متن کاملDetermination of sequence and composition in poly(butyleneadipate-co-butyleneterephthalate) by matrix-assisted laser desorption/ionization time-of-flight mass spectrometry
Matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS) has been used here for the analysis of an unfractionated copolymer sample of poly(butyleneadipate-co-butyleneterephthalate) (PBA/PBT). This is our first report on unfractionated samples, since previous studies by MALDI-TOF MS have concerned copolymer fractions. Mass-resolved peaks are seen up to 8000 Da....
متن کاملApplication Note - Flow Scintillation Analyzer Interfaced with the HPLC and Nuclear Magnetic Resonance Spectrometer
Flow scintillation analysis is commonly used to quantify the radioisotope label on organic compounds such as biochemicals, drugs, and metabolites separated from complex mixtures by High Performance Liquid Chromatography (HPLC). The subsequent task of determining the molecular structure of the separated substances can be formidable. Traditional methods of structure determination involve collecti...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Bioinformatics
دوره 25 2 شماره
صفحات -
تاریخ انتشار 2009